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TATGCTCAAAAATTGTGTACCTTTAGCTTTTTAATTTGTAAAGGGGTTAATAAGGAATATTTGATGTATAGTGCCTTGAC 
ATACGAGTTTTTAACACATGGAAATCGAAAAATTAAACATTTCCCCAATTATTCCTTATAAACTACATATCACGGAACTG ?2 ° 
C S K i v Y L • L F H L • R G • • G I F D Y • C L D 
\ \ \\ l 0 C V V \\ <■_ < r °y K % \ », "« \ >■. " e % »„ T 
I 1 1 1 1 1 1 1 H 1 1 1 1 1 1 1 



H E F I T Y R • SKLKYLP-YP I NSTYHR<5 

■ S \ \ S 'c \ \ \ '» «, l , % 



V 
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BsaB I Dra I 

TAGAGATCATAATCACCCATACCAC^ 

ATCTCTAGTATTAGTCGGTATGGTGTAAACATCTCCAAAATGAACGAAATTTTTTGGAGGGTGTGGAGGGGGACTTGGAC 800 

*RS < S A IPHL • RFYLL • KTSHTSP • T ♦ 
f R D H N Q P Y H I C R G F T C F K K P P T P P P E P 
LEI I I SH TTFVEVLLALKNLPHLP L N L 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 

•L DYDAMGCKYLN • KS FVEWVEGQVQ 

K S t u L i \ G w Y « W v M « Q . L * P K V Q K L F G G v G G G S G S 
SIUI LWVVNTSTKSAKFFRGCRGRFR 

w . , Hinc II 

me f Hpa 1 

AAACATAAAATGAATGCAATTGTT GTTGTTAACTTGTTTATTGCAGCTTATAATGGTTACAAATAAAGCAATAGCATCAC 

J— — t 1 1 1 1 1 1 H 1 f— 1 1 1 1 1 \ 880 

TTTGTATTTTACTTACGTTAACAACAACAATTGAACAAATAACGTCGAATATTACCAATGTTTATTTCGTTATCGTAGTG 

n N v 1 K M *„ M „ Q * T L „ L „ L L T c L L Q L 1 M V T N K A I A S 
ET • NECNCCC • LVY CS L • WLQ I K Q • H H 

KH K M N A I V V V N L F f A A Y N G Y K • S N S I T 
I 1 1 1 1 1 1 1 H 1 1 1 1 1 1 1 1 *. 

F v K F * K C r N „ N „ N * N V K N C S I I T V F L A I A D C 

v Y n Y r F r K H . L , Q m Q « <L \ s T Q L K Y H N C I F C Y C • 

F C L IFAITTTLKNIAA - LP • LYLLLMV 

Xba 1 

AAATTTCACAAATAAAGCATTTTTTTCACTGCA 

TTTAAAGTGTTTATTTCGTAAAAAAAGTGACGTAAGATCAACACCAAACAGGTTTGAGTAGTTACATAGAATAGTACAGA 

K *«■ S « K 1 K o H , F « F « fl - C 1 L v v v c p H S S M Y L T if S 
K « F * H * K S . L F » F T A F ■ LWFVQTHQC I L S C L 
N F TNKAFFSLHSSCGLSKL I NVSYHV 

■ 1 1 1 1 1 1 1 i 1 1 h 1 1 1 1 »• 

I E C I FCKK • QURTTTQGFED IYR I k D 
K K K Y . L 4 M„ « K ? A M • M H M T W V • • H I K D H R 
FKVFLANKESCELQPKDLSMLTD - - t • 

Sph 1 
Nsi 1 

agatcttgtggaatgtgtgtcagttagggtgtggaaagtccccaccctccccagcaggcagaagtatgcaaagcatgcJt 
i 1 1 1 \ i i { 1 1 { 1 1 1 1 1 1 1040 

TCTAGAACACCTTACACACAGTCAATCCCACACCTTTCAGGGGTCCGAGGGGTCGTCCGTCTTCATACGTTTCGTACGTA 

RSCGMCVS GVESPQAPQQAEVCKACI 

D t L t V - K C « V S V R V ¥ K V P y R L P S *R Q K Y A K H A 
; I LWHVCQLGCGKSPGSPAGRSUQSMH 
J 1 1 1 1 1 1 H 1 1 1 1 1 1 1 h 

L c D n Q * P « ! « H « T * L . \ P . T . s L G w A G w c A S T H L A H U 
S t R u T « S c H * T „ D T „ K T „ " F T G L s G L L C F Y A F C A D 
IKHFTH • NPHPFDGPEGAPLL I CLIf C 



Ffg.JSC 
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)h I 
Isi I 

CTCAATTAGTCAGCAACCAGGTGTGGAAAGTCCCCAGGCTCCCCAGCAGGCAGAAGTATGCAAAGCATGCATCTCAATTA _ 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 y 1120 

GAGTTAATCAGTCGTTGGTCCACACCTTTCAGGGGTCCGAGGGGTCGTCCGTCTTCATACGTTTCGTACGTAGAGTTAAT 

S I SQQPGYESPQAPQQAEVCKAC I S I 
SQLVSNQVWKVPRLPSRQKYAKHASQL 
LN • SATRCGKSPGSPAGRSMQSMHLN - 

I 1 I 1 1 \ 1 1 1 \ \ 1 1 1 1 1 1- 

EIL CGPTSLGWAGWCASTHLAHME I L 

• NTLVWTHFTGLSGLLCFYAFCAD N 
RL • DALLHPFDGPEGAPLL I CLMCRL 



Nco I 
pty I 



GTCAGCAACCATAGTCCCGCCCCTAACTCCGCCCATCCCGCCCCTAACTCCGCCCAGTTCCGCCCATTCTCCGCCCCATG 

1 t 1 1 1 1 1 1 1 1 1 I 1 1 1 1- 1200 

CAGTCGTTGGTATCAGGGCGGGGATTGAGGCGGGTAGGGCGGGGATTGAGGCGGGTCAAGGCGGGTAAGAGGCGGGGTAC 

SQQP SRP LRPSRP LRPVPPILRPM 
VSNHSPAPNSAHPAPNSAQFRPFSAPW 
SATIVPPLTPPIPfLTPPSSAHSPPH 

I I 1 1 1 1 1 1 1 1 1 1 1 1 1 Y 

• CGYDRG • SRGDRft • SRGTGGMRRGM 
TLVWLGAGLEAWGAGLEAWNRGNEAGH 
DALMTGGRVGGMGGRVGGLEAWEGGWP 



Hae m Hae fflP gI jHae ffl 

GCTGACTAATTTTTTTTATTTATGCAGAGGCCGAGGCCGCCTCGGCCTCTGAGCTATTCCAGAAGTAGTGAGGAGGCTTT „ „ 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 V 1280 

CGACTGATTAAAAAAAATAAATACGTCTCCGGCTCCGGCGGAGCCGGAGACTCGATAAGGTCTTCATCACTCCTCCGAAA 

AD • FFLFMQRP RPPRPLSYSRSSEEAF 
LTNFFYLCRGRGRLGL * A I PEVVRRL 
G • L I FFIY'AEAEAASASELFQK* G G F 

I 1 1 \ 1 \ 1 1 1 I 1 1 I 1 I 1 \r 

AS • NKKNI CLGLGGRGRL • ELLLSSAK 
SVLKK - KHLPRPRRPRQAIGSTTLLSK 
QSIKKl ASASAAEAESSNWFYHPPK 

Hae m 
Stu I 

Ayr E Ava I 



TTTGGA 



Sty I Xho I 

SGCCTAGGCTTTTGCAAAAAGCTCCCTCGAGAG 



CCT AGGCT TT TGCA A AA AGCTCCCf CGAG AGCT TGGCGT A AT C ATGGTC AT AGCTGT TT CC TGTGTG A AAT T m 



I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 U360 

AAACCTCCGGATCCGAAAACGTTTTTCGAGGGAGCTCTCGAACCGCATTAGTACCAGTATCGACAAAGGACACACTTTAA 

LEA • AFAKSSLESLA* SWS * LFPV * N 
FWRPRLLQKAPSRAWRNHGHSCFLCE I 
FGGLGFCKKLPRELGVIMV I AVSCV KL 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 V 

K S A ■ AKAFLERSLKAYDHDYSNGTHFQ 
QLGLSKCFAGELA QRL • P • LQKRHS I 
KPPRPKQLFSGRSSPT IMTMATEQTFN 
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ps: 



rb I 



CTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAA 

1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1- 1440 

CAATAGGCGAGTGTTAAGGTGTGTTGTATGCTCGGCCTTCGTATTTCACATTTDGGACCCCACGGATTACTCACTCGATT 

CYPLTIPHNIRAGSIKCKAWGA- • V S • 
VIRSQFHTTYEPEA • SVKPGVPNE • AN 

LSAHNSTQHTSRKHKV SLGCLMSEL 
I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 H 

• CSV I G C LMRAPLMFHLAQPA ■ HTL • 
T IRECNWVVYSGSAYLTFGPTGLSHA I 

NDA • LEVCCVLRFCLTYLRPHR I LSSV 

jAsel Pvu II Asel Hae m 

CTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCA 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I- 1520 

GAGTGTAATTAACGCAACGCGAGTGACGGGCGAAAGGTCAGCCCTTTGGACAGCACGGTCGACGTAATTACTTAGCCGGT 

LTL I ALRSLPAFQSGNLSCQ LH • I G Q 

S H LRCAHCPLSSRETCRASCINESA 
THI NCVALTARFPVvGKPVVPAALMNRP 

I 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1- 

SVN I ANRESGAKWD'PFRDHWSC ■ HI PW 

EC N R Q A QGSELRSVQRALQMLSDAL 

• MLQTASVARKGTPFGTTGAAN I FRG 



Sap I 



ACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGC _ 

1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 h 1600 

TGCGCGCCCCTCTCCGCCAAACGCATAACCCGCGAGAAGGCGAAGGAGCGAGTGACTGAGCGACGCGAGCCAGCAAGCCG 

RAGRGGLRIGRSSASSLTDSLRSVVR 
NARGEAVCV LGALPLPRSLTRCARSFG 
TRGERRFAYWALFRFLAH LAALGRSA 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 

RAPLPPKRIPREE AEESVSESRETTRS 
ARPSATQTNPARGSGRESVRQARDNP 
VRPSLRNAYQASKRKRA • QSAASPREA 



BsrB I 

TGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGT AATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATG 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 1680 

ACGCCGCTCGCCATAGTCGAGTGAGTTTCCGCCATTATGCCAATAGGTGTCTTAGTCCCCTATTGCGTCCTTTCTTGTAC 

LRRAVSAHSKAV I RLSTESGDNAGKNM 
CGERYQLTQRR • YGYPQNQG I TQERTC 
AASGISSLKGGNTVIHRIRG RRKEH 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 k 

RRATDA • EFAT IRNDVSDPSLAPFFM 
QP SRY SV LRYYP GCF PIVCSLVH 

AALPILESLPPLVTIWLILPYRLFSCT 
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jHae m pae m ^ae m 



TGAGCAAAAGGCCACCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGA 

1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1- 1760 

ACTCGTTTTCCGGTCGTTTTCCGGTCCTTGGCATTTTTCCGGCGCAACGACCGCAAAAAGGTATCCGAGGCGGGGGGACT 

• AKGQQKARNRKKAALLAFFHRLRPPD 
EQKASKRPGTVKRPRCWRFS IGSAPL 
VSKRPAKGQEP • KGRVAGVFP * APPP • 
I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 h 

HAFPWCFALFRLFAANSANKWLSRGGS 
SCFALLLGPVTFLGRQQRKEMPEAGRV 
LLLGAFPWSGYFPRTAPTKGYAGGGQ 

CGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1840 

GCTCGTAGTGTTTTTAGCTGCGAGTTCAGTCTCCACCGCTTTGGGCTGTCCTGATATTTCTATGGTCCGCAAAGGGGGAC 

EHHKNRRSSQRWRNPTGL • RYQAFPP 
TS I TK I DAQVRGGETRQDYKDTRRFPL 
RASQKSTLKSEVAKPDRT IKIPGVSPW 

I 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 b 

SC • LFRREL • LHRF GVPSYLYWANGGP 
LMVFISA • TLPPS VRCS • LSVLRKGR 
RADCFDVSLDSTAFGSLVI F IGPTEGQ 

GAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACGGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTG _ 

\ 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 1- 1920 

CTTCGAGGGAGCACGCGAGAGGACAAGGGTGGGACGGCGAATGGCCTATGGACAGGCGGAAAGAGGGAAGCCCTTCGCAC 

GSSLVRSPVPTLPLTGYLSAFLPSGSV 
EAPSCALLFRPCRLPDTCPPFSLREAW 
KLPRALSCSDPAAYRIPVRLSPFGKR 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1- 

LERTRECTGVRGSVPYRDAKRGEPLT 
SAGEHARRN RGQRKGSVQGGKERRSAH 
FSGRASEQESGAA RIGTRREGKPFRP 



ApaL I 



GCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 2000 

CGCGAAAGAGTTACGAGTGCGACATCCATAGAGTCAAGCCACATCCAGCAAGCGAGGTTCGACCCGACACACGTGCTTGG 

ALSQCS RCRYLSSV • VVRSKLGCVHEP 
RFLNAHAVG I SVRCRSFAPSWAYCTM 
GAFSMLTL • VSQFGVGRSLQAGLCART 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 y 

ASE * HERQLYRLETYTTRELSPQTCSG 
RKRLA - ATPI ETRHLDKAGLQATHVFG 
AKEI SVSYTD • NPTPRESWAPSHARV 



ci I 



CCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCAC 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 tZOBO 

GGGGCAAGTCGGGCTGGCGACGCGGAATAGGCCATTGATAGCAGAACTCAGGTTGGGCCATTCTGTGCTGAATAGCGGTG 

PVQPDRCALSGNYRLESNPVRHDLSP 
PPFSPTAAPYPVTIVLSPTR - DTT YRH 
PRSARFLRLIR* LSS • VQPGKTRLIAT 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 

GT GSRQAKDPL ■ RRSDLGTLCSKDGS 
G KLGVAAG • GTY I TKLGVRYSVV RW 
GREARGSRRI RYSDDQTWGPLVRS I A V 
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e m 



TGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGACTTCTTGAAGTGGTGGCCTAAC 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 2160 

ACCGTCGTCGGTGACCATTGTCCTAATCGTCTCGCTCCATACATCCGCCACGATGTCTCAAGAACTTCACCACCGGATTG 

LAAATGHR ISRARYVGGATEFLKWWPN 
WQQPLVTGLAERGM • AVLQSS • SGGLT 
GSSHV • QD • QSEVCRRCYRVLEVVA 

i 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 v 

AAAVPLL I LLALYTPPAVSNKFHHGL 
QCCGSTVPNASRP I YATSCLEQLPPRV 
PLLWQYCS • CLSTHLRH • LTRSTTA • S 

TACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTC 
1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 h 2240 

ATGCCGATGTGATCTTCCTGTCATAAACCATAGACGCGAGACGACTTCGGTCAATGGAAGCCTTTTTCTCAACCATCGAG 

YGYTRRTVFGI CALLKPVTFGKRVGSS 
TATLEGQYLVSALC ■ SQLPSEKELVA 
LRLH - KDS IWYLRSAEASYLRKKSW L 
I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 

* P • VLLVTNP I QAR«SFGTVKPFLTPLE 
VAVSSPCYKTDASQ, QLWNGESFSNTAR 
RSC • FSL I QYRREASAL • RRFFLQYS 

TTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGAT 
h- 1 1 i 1 1 1 1 ■ — i 1 1 H 1 I 1 1 2320 

AACTAGGCCGTTTGTTTGGTGGCGACCATCGCCACCAAAAAAACAAACGTTCGTCGTCTAATGCGCGTCTTTTTTTCCTA 

• SGKQTTAGSGGFFVCKQQ ITRRKKG 
LDPANKPPLVAVVFLFASSRLRAEKKD 
L IRQTNHRW - RWFFCLQAADYAQKKR I 

I 1 1 1 1 1 I 1 1 1 1 1 1 1 I 1 h 

QDPLCVVAPLP. PKKTQLCC I VRLFFPD 
SGAFLGGST ATTKKNALLLNRASFFS 
K I RCVFWRQ YRHNKQKCAAS ACFFL I 



BspH I 



CTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATG „ _ M 

1 1 1 1 1 I 1 1 1 1 1 I 1 1 1 1- 2400 

GAGTTCTTCTAGGAAACTAGAAAAGATGCCCCAGACTGCGAGTCACCTTGCTTTTGAGTGCAATTCCCTAAAACCAGTAC 

SQEDPL1FSTGSDAQWNENSR • GI LVlt 
LKKI L • SFLR GLTLSGTKTHVKGF WS • 
SRRSFDLF YGV • RSVERKLTLRDFGH 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 

•SSGKIKEVPDSA • HFSFER • P I KTU 
RLF IRQQKRRPRVSLPVFV TLPKQDH 
ELLDKSRK • P TQRETSRFSVNLSK P • S 



jDra I Dra I 



AGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTA A 

I 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 *2480 

TCTAATAGTTTTTCCTAGAAGTGGATCTAGGAAAATTTAATTTTTACTTCAAAATTTAGTTAGATTTCATATATACTCAT 

SKR j FT • I LLK • K • SFKS I SIYE- 
DYQKGSSPRSF IKNEVLNQSKVYMS 
EI I KKDLHLDPFKLKICKF INLKY I • V 

I 1 1 1 1 1 1 1 1 1 I i 1 1 1 : — I ¥ 

L NDFL I KV IRKF • FHLKLDI • LIYSY 
S • FPDEGLDK 1LFSTKF DLTY1LL 

I I LFSR* RSGKLHFI FN - ILRFYIHT 
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AACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTG 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 H 2560 

TTGAACCAGACTGTCAATGGTTACGAATTAGTCACTCCGTGGATAGAGTCGCTAGACAGATAAAGCAAGTAGGTATCAAC 

TWSDSYQCLISEAPISAICLFRSSIV 
KLGLTVTNA - SVRHLSQRSVYFVHP L 
NLV • QLPMLNQ • GTYLSDLS I SF IHSC 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 H 

VQDSL WHKl LSAGI EAIQRNREDMTA 

SPRVTVLA DTLCRD RDT KT GYN 

FKTQCHG I S L * HPV - R LSRD I ENMWLQ 



ae m 



CCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGAC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1- 2640 

GGACTGAGGGGCAGCACATCTATTGATGCTATGCCCTCCCGAATGGTAGACCGGGGTCACGACGTTACTATGGCGCTCTG 

A • L P V V • ITTIREGLPSGPSAAMIPRD 
PDSPSCR* LRYGRAYHLAP VLQ YRET 
LTPRRVDNYDTGGLT IWPQCCNDTAR 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 h 

QSGTTYIVVIRSPKGDPGLAAI IGRS 
GSEGDHLYSRYPLA • WRAGTSCHYRSV 
RVGRRTSL SVPPSVMQGWHQLSVALG 



3gll 

pae m ^va II 



CCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTT MM 

1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 h 2720 

GGTGCGAGTGGCCGAGGTCTAAATAGTCGTTATTTGGTCGGTCGGCCTTCCCGGCTCGCGTCTTCACCAGGACGTTGAAA 

PRSPAPDLSAINQPAGRAERRSGPATL 
HAHRLQIYQQ - TSQPEGPSAEVVLQL 
PTLTGSRF I SNKPASRKGRAQKWSCNF 

I 1 1 1 1 1 1 ■ — I 1 1 1 1 1 1 1 1 h 

GREGAGSKDAIFWGAPLASRLLPGAVK 
WA RSWI • CYVLWGSPGLASTTRCS 
VSVPELN I LLLGALRFPRACF-HDQLK 

Asel Nci I Fsp I 

ATCCGCCTCCATCCAGTCT|TTAATTGTTGCCGGGAAGCTAGAGTAAG g800 

TAGGCGGAGGTAGGTCAGATAATTAACAACGGCCCTTCGATCTCATTCATCAAGCGGTCAATTATCAAACGCGTTGCAAC 

SAS I QS I NCCREARVSSSPVNSLRNV 
YPPPS S LL IVAGKLE • VVRQLIVCATL 
IRLHPVY'LLPGS-SK-FAS- FAQRC 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 V 

DAEUWD I LQQRSALTLLEGTLLKRLTT 

GGGDLRNITAPFSSYTTRWNI TQAVN 
IRRWGT- NNGPL * LLYNAL * YNACRQ 

TTGCCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGG OOOA 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I- 2880 

AACGGTAACGATGTCCGTAGCACCACAGTGCGAGCAGCAAACCATACCGAAGTAAGTCGAGGCCAAGGGTTGCTAGTTCC 

VAIATGIVVSRSSFGUASFSSGSQRSR 
LPLLQASWCHARRLVWLHSAPVPNDQG 
CHCYRHRGVTLVV WYGFIQLRFPT IK 

I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 — i 1- 

AlfAVPUTTDREDNP I AENLEPEWRDL 
NGNSCADHH * ARRKTHS • EAGTGLS • P 
Q W Q • LCRPTVSTTQYPKM - SRNGVI LA 
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Ava n Pvu I Hae IH 

CCAGTTACATGATCCCCCATGT TGTGCAAAAAAGCGGTTAGCTCCTTCGOTCCTCCGATCGTTGTCAGAAGTAAOTTrJr 

■ I 1 1 I I 1 | \ 1 1 | I I | | pQfiA 

GCTCAATGTACTAGGGGGTACAACACGTTTTTTCGCCAATCGAGGAAGCCAGGAGGCTAGCAACAGTCTTCATTCAACCG 

1 1 ' 1 1 1 1 I 1 1 1 1 1 1 1 1 L 

RT VHDGMNHLFAT L EKPGG I TTLLLNA 

S i N C u S t G n G w K Q * A „ P c F R N A G E T R R D M D S T L Q A G 
L * MICWTTCFLP • SRRDESRQ - F Y T P 

CGCAGTGTTATCACTCATGGTTATG GCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTCA 

1 1 1 1 1 1 I I I I I I I I I i **040 

GCGTCACAATAGTGAGTACCAATACCGTCGTGACGTATTAAGAGAATGACAGTACGGTAGGCATTCTACGAAAAGACACT 

« A « V „ k, S LMVMAALHNS LTVMPSVRC FS V 
P « Q « c Y H s w L W Q H C I I L L L S C H P ■ D A F L • 
.R S V 1 T, H G Y G S T A • F S Y C H A I R K M L F C D 

1 1 1 1 1 1 1 1 f— — I 1 1 1 1 1 1 h 

A T NDS MT IAASCLERVTMGDTLHKETV 
R L T I ' V • »P • \ C L C V \ \ < N R E K - S Q °. \ \ % \ \\\\\ 

Rsa I 

pea I Nci I Hinc H 

CTGGTGAGTACTCAACCAAGTCATTCT GAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAACACGGGAT 

1 1 1 I I \ | | | | — ■ - | | | I i I 3120 

GACCACTCATGAGTTGGTTCAGTAAGACTCTTATCACATACGCCGCTGGCTCAACGAGAACGGGCCGCAGTTGTGCCCTA 
T GEYSTKSF * E * ClfRRPSCSCPASTRn 

L w v S v T i % P o K H , S t E o N t K\ c g d r v a t A R R Q h R g D i 
. * ' v . LNQVILRIVY-AATELLLPGVNTG 
1 " ' 1 1 1 1 1 1 1— 1 1 1 1 1 1 1 1- 

c P * S r Y « E V „ L , D N „ Q « s « Y » 1 R R G L Q E Q G A D V R S 
T L V • G L E S F L T H P S R T A R A R R - C P I 
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ACCGCTGTTGAGATCCAGTTCG ATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCCTTT 
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